Probabilistic Classi cation of Monophonic Instrument Playing Techniques
نویسندگان
چکیده
1. Introduction Understanding the underlying intentions of a music performer is crucial to enable a machine such as an automated accompaniment system to interact intelligently with a musician. Particularly, understanding the symbol associated with a tone the player generates allows a machine to create response that is in concordance with the symbol. We dene playing technique as a symbol such as the expression marking that reects the intention of a human performer in a per-ceivable change in timbre. We believe that it is crucial to understand that some playing techniques are inherently ambiguous, and to associate with an input signal the degree of ambiguity along with the estimated class labels. For example, if the class of dynamics are described by soft and loud, it is irrelevant to ask whether a moderate sound is soft or loud-it only makes sense to say that it either belongs to both or to neither. We shall express ambiguity by modeling a set of playing techniques as a posterior distribution, and using statistics obtained from the distribution to determine the ambiguity. We shall group playing techniques that acts on a same, continuous quality into one set that in turn generates a posterior distribution as shwon in Figure 1. Particularly, we hypothesize that ambiguous sounds are the main cause of misclas-sication, and such sounds creates a distribution with a high variance. Existing research in detecting the playing technique involved discretization of techniques involving a continuous factor and converting it into a problem of classication. For example, the position of the bow on a violin was discretized by recording two points and choosing one of the two positions [1]. Other research exclusively dealt with playing techniques of discrete quality such as whether a bass guitar string was slapped or not [2]. Both approaches did not associate any posterior probability with the output label, and thus suffered when recognizing notes that even humans have trouble distinguishing [1]. Another approach involved extraction of perceptual features that were used to control another musical instrument [3]. This approach did not attempt to symbolize timbre into playing techniques. In this paper, we modeled a set of playing techniques that acts on a same continuous factor (position of the bow on a violin) as a posterior distribution given the input signal using a hybrid of Gaussian Mixtures (GMM) and Relevance Vector Machine (RVM). We then rejected data whose variance exceeds a threshold given …
منابع مشابه
Probabilistic Disease Classi cation of Expression-Dependent Proteomic Data from Mass Spectrometry of Human Serum
We have developed an algorithm called Q5 for probabilistic classi cation of healthy versus disease whole serum samples using mass spectrometry. The algorithm employs principal components analysis (PCA) followed by linear discriminant analysis (LDA) on whole spectrum surface-enhanced laser desorption/ionization time of ight (SELDI-TOF) mass spectrometry (MS) data and is demonstrated on four r...
متن کاملRough Set Based Automatic Classification of Musical Instrument Sounds
Automatic classi cation of audio data arose increasing interest recently. This paper addresses the problem of automatic recognition of musical instrument sounds, applying rough set based techniques as a tool of classi cation. Instruments representing wind and string families were used in the experiments. Since the main problem in case of audio data is the proper parameterization, we also invest...
متن کاملCombining weak learners with probabilistic models for binary classication
In this project it is addressed the problem of binary classi cation i.e. we want to nd a model F : X! f0; 1g for the dependence of a class c on x from a set of samples D = fct;xtgt=1 with ct 2 f0; 1g. The approach is to combine multiple base learners with the use of probabilistic models. In particular, I used the methodology presented by Ghersen eld for doing regression in complex time series[...
متن کاملUsing the Fisher kernel method for Web audio classification
As the multimedia content of the Web increases techniques to automatically classify this content become more important. In this paper we present a system to classify audio les collected from the Web. The system classi es any audio le as belonging to one of three categories: `speech', `music' and `other'. To classify the audio les, we use the technique of Fisher kernels. The technique as propose...
متن کاملSupervised Classi cation a Probabilistic Approach
In this tutorial some classical tools of data analysis and others less well known are surveyed in detail that can be applied to supervised classi cation and in particular to learning with neural net works
متن کامل